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SEQUENCE LISTING 
(1) GENERAL INFORMATION: j 


(i) APPLICANT: Schering Corporation 


(ii) TITLE OF INVENTION: 


Thioredoxin/ Heterologous Protein 
Bacterial Expression System 


(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRES: 

(A) ADDRESSEE: Schering-Plough Corporation 

(B) STREET: 2000 Galliping Hill Road 

(C) CITY: Kenilworth 

(D) STATE: New Jersey 

(E) COUNTRY: U.S.. 

(F) ZIP : 07033-0530 

(v) COMPUTER READABLE FOI 

(A) MEDIUM TYPE: (diskette 

(B) COMPUTER: Apj>le Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.5.3 

(D) SOFTWARE: Microsoft Word 5.1a 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION jsTUMBER: 

(B) FILING DATE: , 

(vii) PRIOR APPLICATION JDATJ 

(A) APPLICATION NUMBER: US 60/011,606 

(B) FILING DATE: 30-APR-1996 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Thampde, Immac J.. 

(B) REGISTRATION NUMBER: 36,322 

(C) REFERENCE DOCKET NUMBER: JB0600Q 


(2) INFORMATION FOR SEQ ID JSjO:l 
(i) SEQUENCE CHARACTERIS r ICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic a :id 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA ( 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO:l 
C C TGTGGAGT TACATATGAG CGATAAAATT 30 

(2) INFORMATION FOR SEQ ID NO:2 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GCACCCAACA TGCAAGGATC CTTACGCC AG ATTAGCATCG AGGAACT 

(2) INFORMATION FOR SEQ ID NO:3 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 

ATGAGCGATA AAATTATTCA CCTGACTGAC GACAGTTTTG ACACGGATGT 50 

ACTCAAAGCG GACGGGGCGA TCCTCGTCGA TTTCTGGGCA GAGTGGTGCG 100 

GTCCGTGCAA AATGATCGCC CCGATTC TGG ATGAAATCGC TGACGAATAT 150 

CAGGGCAAAC TGACCGTTGC AAAACTG AAC ATCGATCAAA ACCCTGGCAC 2 00 

TGCGC CGAAA TATGGCATCC GTGGTAT CCC GACTCTGCTG CTGTTCAAAA 2 50 

ACGGTGAAGT GGCGGCAACC AAAGTGOGTG CACTGTCTAA AGGTCAGTTG 3 00 
AAAGAGTTCC TCGATGCTAA TCTGGCOTAA GGATCC 

(2) INFORMATION FOR SEQ ID NO:4 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bas*s pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
linear 


(D) TOPOLOGY: lined 
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(ii) MOLECULE TYPE:|cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 


ATGAGCGATA 
ACTCAAAGCG 
GTCCGTGCAA 
CAGGGCAAAC 
TGCGCCGAAA 
ACGGTGAAGT 
AAAGAGTTCC 


AAATTATTC! 

GACGGGGCC 

AATGATCGC 

TGACCGT1 

TATGGCATC 

GGCGGCAAC 

TCGAGGCTi 


:a cctgactgac 
;a tcctcgtcga 
c ccgattctgg 
c aaaac tgaac 
c gtggtatccc 

!c AAAGTGGGTG 
TCTGGCGTAA 


GACAGTTTTG 
TTTCTGGGCA 
ATGAAATCGC 
ATCGATCAAA 
GACTCTGCTG 
CACTGTCTAA 
GGATCC 


ACACGGATGT 50 

GAGTGGTGCG 100 

TGACGAATAT 150 

ACCCTGGCAC 200 

CTGTTCAAAA 250 

AGGTCAGTTG 3 00 


(2) INFORMATION FOR SEQ ID NO:5 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 bases pairs 

(B) TYPE: kucleic acid 

(C) STRAIWDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5 


GAT AAT ATT CTG GCT 
Asp Asn Asn Leu Ala 

1 - 5 


3T TCT GGT TCT GGT GAT GAC GAT GAC AAG 45 
Ser Gly Ser Gly Asp Asp Asp Asp Lys 
10 15 


GGT CCT GTT CCG CCG TCT ACC GCT CTG CGT GAG CTC 
Gly Pro Val Pro Pro Sor Thr Ala Leu Arg Glu Leu 
20 25 


(2) INFORMATION FOR SEC 2 ID NO:6 
(i) SEQUENCE CHARAC rERISTlCS; 


27 amino acids 
o acid 


(A) LENGTH 

(B) TYPE: 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6 


Asp Asn Asn Leu Ala Gly I Ser Gly Ser Gly Asp Asp Asp Asp Lys 
1 5 I 10 15 


Gly Pro Val Pro Pro Ser [Thr Ala Leu Arg Glu Leu 
20 I 25 
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(2) INFORMATION FOR SEQ ID NO:7 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 bases 

(B) TYPE: nucleic acid 

(C) STRANlDEDNESS: single 

(D) TOPOI OGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCR PTION: SEQ ID NO:7 


GAAGGAGGCT GATTAAATG 
TC 


GTCCGGTTCC GCCGTCTACC GCTCTGGAGC 50 


(2) INFORMATION FOR : >EQ ID NO:8 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRA1KTDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: 1 )NA 

(xi) SEQUENCE DESCEIPTION: SEQ ID NO:8 

AAGGAGGCTG ATMAATG 

(2) INFORMATION FOR ?EQ ID NO:9 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: nucleic acid 

(C) STRA MDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCEIPTION: SEQ ID NO:9 

AAGGAGGCTG ATT AATG 

(2) INFORMATION FOR SEQ ID NO:10 
(i) SEQUENCE CHAR^ lCTERISTICS: 

V 
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(A) LENGTH: 14 bases 

(B) TYPE: nlcleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION SEQ ID NO:10 

AAGGAGGTTT AATG 

(2) INFORMATION FOR SE<b ID NO:ll 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: am no acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pep tide 
(xi) SEQUENCE DESCRIPTION SEQ ID NO: 11: 


Leu 
1 


Asp Ala 


Asn 


Leu 
5 


Ala 


(2) INFORMATION FOR SEQ ID NO:12 
(i) SEQUENCE CHARAC TERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nu:leic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION SEQ ID NO:12 


CTC GAT 
Leu Asp 
1 


GCT 
Ala 


(2) INFORMATION FOR S^Q ID NO:13 
(i) SEQUENCE 

(A) LENGHjl: 

(B) TYPE: nucleic 

(C) STRANDEDNESS: single 


AAT CTG GCG 
Asn Leu Ala 
5 


TAA 


CHARACTERISTICS: 
: 21 bases 
ic acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: lj>NA 
(xi) SEQUENCE DESCFIPTION SEQ ID NO:13 


CTC GAG 
Leu Glu 
1 


Gcr 


ALi 


CHAR/ .CTERISUCS: 


LENGITi; 


(2) INFORMATION FOR $EQ ID NO:14 
(i) SEQUENCE 
(A) 

(B) TYPE: 
(D) TOPOLOGY 

(ii) MOLECULE TYPE: p]eptid 

(xi) SEQUENCE 


; DESCR PTION 


AAT CTG GCG 
Asn Leu Ala 
5 


TAA 


: 6 amino acids 
4mino acid 

: linear 
e 

SEQ ID NO:14 


Leu Glu Ale. 
1 


Asn Leu Ala 
5 


